Prizm Content Connect
Performing Auto-Redaction

An auto-redaction is a multi-step process that finds matches of a given regular expression, then permanently removes the text and blacks out the displayed region for each match. The end result is a new PDF document with no traces of the text that matched the regular expression.

The following steps will walk you through the auto-redaction process using the PCC RESTful Web Services.

Step 1: Upload Your Source Document

Example
Copy Code
POST http://192.168.0.1:18681/PCCIS/V1/WorkFile?FileExtension=pdf
Content-Type: application/octet-stream
[binary data]

200 OK
Content-Type: application/json
{
    "fileId": "5qTYa3gzN9gYUb5SzqUhqg",
}

Step 2: Compose a Regular Expression

Example
Copy Code
"[0-9]{3}[-]?[0-9]{2}[-]?[0-9]{4}"

Step 3: Create Markup XML from the Regular Expression

Before the actual redaction process can be started, the regular expression needs to be converted to a format it can understand. PCC uses a proprietary XML syntax to define markups used for redaction, which you can generate by sending a POST request which requires two inputs:

Example
Copy Code
POST http://192.168.0.1:18681/PCCIS/V1/RedactionCreator
Content-Type: application/json
{
    "input": {
        "documentFileId": "5qTYa3gzN9gYUb5SzqUhqg", 
        "autoRedactionRegularExpressions": ["[0-9]{3}[-]?[0-9]{2}[-]?[0-9]{4}"]
    }
}

200 OK
Content-Type: application/json
{
    "processId": "Rr64ma-U_HseoPrs6y0iiw",
    "expirationDateTime": "2014-12-03T18:30:49.460Z",
    "input": {
        "documentFileId": "5qTYa3gzN9gYUb5SzqUhqg",
        "autoRedactionRegularExpressions": ["[0-9]{3}[-]?[0-9]{2}[-]?[0-9]{4}"]
    },
    "state": "processing",
    "percentComplete": 0
}

Step 4: Check Status of the RedactionCreator Resource

Example
Copy Code
GET http://192.168.0.1:18681/PCCIS/V1/RedactionCreator/Rr64ma-U_HseoPrs6y0iiw

200 OK
Content-Type: application/json
{
    "processId": "Rr64ma-U_HseoPrs6y0iiw",
    "expirationDateTime": "2014-12-03T18:30:49.460Z",
    "input": {
        "documentFileId": "5qTYa3gzN9gYUb5SzqUhqg",
        "autoRedactionRegularExpressions": ["[0-9]{3}[-]?[0-9]{2}[-]?[0-9]{4}"]
    },
    "state": "complete",
    "percentComplete": 100,
    "output": {
        "markupFileId": "o1bLJwFGxf9QGuTkyrOqig"
    }
}

Step 5: Start the Markup Burning Process (Redaction)

Example
Copy Code
POST http://192.168.0.1:18681/PCCIS/V1/MarkupBurner
Content-Type: application/json
{
    "input": {
        "documentFileId": "5qTYa3gzN9gYUb5SzqUhqg", 
        "markupFileId": " o1bLJwFGxf9QGuTkyrOqig"
    }
}

200 OK
Content-Type: application/json
{
    "processId": "bQpcuixhvGmNqn5ElskO6Q",
    "expirationDateTime": "2014-12-03T18:30:49.460Z",
    "input": {
        "documentFileId": "5qTYa3gzN9gYUb5SzqUhqg",
        "markupFileId": " o1bLJwFGxf9QGuTkyrOqig"
    },
    "state": "processing",
    "percentComplete": 0
}

Step 6: Check Status of the MarkupBurner Resource

Example
Copy Code
GET http://192.168.0.1:18681/PCCIS/V1/MarkupBurner/ bQpcuixhvGmNqn5ElskO6Q

200 OK
Content-Type: application/json
{
    "processId": " bQpcuixhvGmNqn5ElskO6Q ",
    "expirationDateTime": "2014-12-03T18:30:49.460Z",
    "input": {
        "documentFileId": "5qTYa3gzN9gYUb5SzqUhqg",
        "markupFileId": " o1bLJwFGxf9QGuTkyrOqig"
    },
    "state": "complete",
    "percentComplete": 100,
    "output": {
        "documentFileId": "5ufb3ytUb1BxxgSUAk_G9Q"
    }
}

Step 7: Download the Redacted Document

Example
Copy Code
GET http://192.168.0.1:18681/PCCIS/V1/WorkFile/5ufb3ytUb1BxxgSUAk_G9Q

200 OK
Content-Type: application/pdf
[binary data]

 

 


©2014. Accusoft Corporation. All Rights Reserved.

Send Feedback